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fS (54) Title: METHODS AND MEANS OF HI STONE METHYLATION 

(57) Abstract: This invention relates to the methyl ation of hi stones, in particular to a previously uncharacterised group of histone 
^ H3 methylases which comprise a SET domain and which methylate either lysine 4 within the amino tail of histone H3 or within the 
histone H3 core. Methylation by these methylases, in particular trimethylation, is shown to be important for transcriptional activity. 
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Methods and Means of Histone Methylation 

This invention relates to histone methylation, in 
particular, to the methylation of specific lysine 
residues on histone H3 and to the biological effects of 
such methylation. 

DNA in the eukaryotic nucleus is wrapped around a 
histone-core which is a protein complex involving the 
four histones H4, H3, H2B and H2A. This DNA-histone 
structure (nucleosome) is not compatible with gene 
expression. Re-organisation of the nucleosome is 
required for transcription factors and RNA polymerase to 
have access to the DNA for transcription. 

Covalent post-translational modifications of the amino- 
terminal tails of histones regulate the transcriptional 
*on' or *off ' states of chromatin and influence 
chromosome condensation and segregation. 

Such modifications include acetylation, phosphorylation 
and methylation. 

Suv39Hl is a methyl transferase which is specific for 
lysine 9 of histone H3 (Rea et al (2000) Nature 4 06 593- 
598) . Methylation of lysine 9 by Suv39Hl leads to the 
recruitment of the HP1 repressor protein (Bannister et al 
(2001) Nature 410 120-124, Lachner et al (2001) 410 116- 
12 0) and the formation of transcriptionally silent 
heterochromatin . 

Suv3 9Hl has homologues in human (Suv3 9H2) , Drosophila 
(su(var)3-9) and S. pombe (clr4) . Defects in Suv39Hl and 
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its homologues have been correlated with defects in 
transcription and aberrant mitotic division and 
chromosomal mis -segregation . 

Expression of the Suv3 9hl protein has also been shown to 
inhibit cell proliferation of mammalian cells and its 
phosphorylation status is known to change as cells 
progress from Gl to S. 

The Suv3 9hl methyltransf erase comprises an evolutionarily 
conserved domain called a SET domain (SUV39 Enhancer of 
Zest and Tri thorax) . However, SET domains are also found 
in other proteins which do not have histbne methylase 
activity (Rea et al . (2000)}. 

A histone methylase activity has also been identified in 
yeast Clr4 this activity has been shown to be essential 
for transcriptional repression (Bannister et al (2001) 
Nature 410 120-124) . Lack of Clx4 methylase activity 
activates heterochromatically repressed genes at 
centromeres and may lead to chromosome mis-segregation. 

The present inventors have discovered that a group of 
previously uncharacterised SET domain containing proteins 
are histone H3 methylases. This group includes S. 
cerevisiae SET1 (YHR119W) and its human homologues 
hSETl.lP (KIAA0339) and hSET1.2P (KIAA0339) and S. 
cerevisiae SET2 (YJL168C) and its human homologues hSET2 
(AJ238403) and WHSC1 (XP_003388) . 

The present inventors have further discovered that these 
methylases have activities which are distinct from 
Suv39hl and other previously characterised histone 
methylases. SET1 and related proteins specifically 



# 



WO 02/090578 



PCT/GB02/02050 



3 



methylate lysine 4 of histone. H3 while SET2 and related 
proteins specifically methylate a lysine residue outside 
the amino tail region of histone H3 (i.e. a core lysne 
residue) . 

Various aspects of the present invention provide for the 
use of a SET polypeptide and a histone H3 polypeptide in 
screening methods and assays for agents which modulate 
methylation of histone H3 by the SET polypeptide, and 
which may therefore be useful in the modulating cellular 
proliferation, for example in treating conditions such as 
cancer. 

In a general aspect, the present invention provides an 
assay method for an agent with ability to modulate, e.g. 
disrupt, interfere with, or increase interaction and/or 
binding of histone H3 with a SET polypeptide, the method 
including: 

(a) bringing into contact a SET polypeptide and a 
histone H3 polypeptide; and, 

(b) determining binding and/or interaction between the 
histone H3 polypeptide and the SET polypeptide. 

An assay may be carried out in the presence of a test 
compound under conditions in which, in the absence of the 
test compound, the SET polypeptide will interact or bind 
with histone H3 . 

The precise format of the assay of the invention may be 
varied by those of skill in the art using routine skill 
and knowledge. For example, the interaction between the 
polypeptides may be studied in vitro by labelling one 
with a detectable label and bringing it into contact with 
the other which has been immobilised on a solid support. 
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Suitable detectable labels include 35 S -methionine which 
may be incorporated into recombinantly produced peptides 
and polypeptides, Recombinantly produced peptides and 
polypeptides may also be expressed as a fusion protein 
containing an epitope which can be labelled with an 
antibody. 

Fusion proteins may be generated that incorporate six 
histidine residues at either the N-terminus or C-terminus 
of the recombinant protein. Such a histidine tag may be 
used for purification of the protein by using 
commercially available columns which contain a metal ion, 
either nickel or cobalt (Clontech, Palo Alto, CA, USA) . 
These tags also serve for detecting the protein using 
commercially available monoclonal antibodies directed 
against the six histidine residues (Clontech, Palo Alto, 
CA, USA) . 

A protein may be immobilized on a solid support using an 
antibody against that protein bound to a solid support or 
via other technologies which are known per se. A 
preferred in' vitro interaction may utilise a fusion 
protein including glutathione-S-transf erase (GST) . This 
may be immobilized on glutathione agarose beads. In an 
in vitro assay format of the type described above, a test 
compound can be assayed by determining its ability to 
diminish the amount of labelled peptide or polypeptide 
which binds to the immobilized GST- fusion polypeptide. 
This may be determined by fractionating the glutathione- 
agarose beads by SDS-polyacrylamide gel electrophoresis. 
Alternatively, the beads may be rinsed to remove unbound 
protein and the amount of protein which has bound can be 
determined by counting the amount of label present in, 
for example, a suitable scintillation counter. . 
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An assay according to the present invention may also take 
the form of an in vivo assay. The in vivo assay may be 
performed in a cell line such as a yeast strain in which 
the relevant polypeptides or peptides are expressed from 
one or more vectors introduced into the cell. 

Various methods and uses of modulators which inhibit, 
potentiate, increase or stimulate methylation of histone 
H3 by a SET polypeptide are provided as further aspects 
of the present invention. 

The purpose of disruption, interference with or 
modulation of the methylation of histone H3 by a SET 
polypeptide may be to modulate cellular functions such as 
transcription and proliferation which are mediated by 
virtue of such methylation, such as transcription, as 
discussed above and further below. 

A method of screening for a substance which modulates 
activity of a SET polypeptide may include contacting one 
or more test substances or agents with the SET 
polypeptide in a suitable, reaction medium, testing the 
activity of the treated polypeptide and comparing that 
activity with the activity of the polypeptide in 
comparable reaction medium untreated with the test 
substance, substances or agent. A difference in activity 
between the treated and untreated SET polypeptides' is 
indicative of a modulating effect of the relevant test 
substance or substances. 

Thus another aspect of the present invention provides an 
assay method for an agent with ability to modulate, e.g. 
disrupt, interfere with, increase or stimulate the 
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histone H3 methylase activity of a SET polypeptide, the 
method including: 

(a) bringing into contact a SET polypeptide and a test 
compound ; and , 

(b) determining the histone H3 methylase activity of the 
SET polypeptide. 

An assay may be carried- out under conditions in which, in 
the absence of the test compound, the SET polypeptide 
will possess histone H3 methylase activity. Histone H3 
methylase activity may be determined as described herein. 

The SET polypeptide and the test compound may, for 
example, be brought into contact in the presence of a 
histone H3 polypeptide. Histone H3 methylase activity may 
then be determined by determining the presence of one or 
more methyl groups at a lysine residue of the histone H3 
polypeptide. 

In another aspect, the present invention provides a 
method for identifying and/or obtaining an agent with 
ability to modulate, e.g. disrupt, inhibit, interfere 
with, increase or stimulate methylation of histone H3 by 
a SET polypeptide, the method including: 

(a) bringing into contact a SET polypeptide and a 
histone H3 polypeptide in the presence of a test 

c ompound ; and , 

(b) determining methylation of the histone H3 
polypeptide. 

An assay may be carried out under conditions in which, in 
the absence of the test compound, the SET polypeptide 
will methylate histone. H3. 
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Determining methylation of the histone H3 polypeptide may 
comprise determining the presence or absence of one or 
more methyl groups on a lysine residue of said histone H3 
polypeptide, for example lysine 4 or a core lysine 
5 residue . 



.Where the SET polypeptide is a SET1 polypeptide, 
methylation of the lysine 4 residue of histone H3 may be 
determined. Where the SET polypeptide is a SET2 

10 polypeptide, methylation of a lysine residue selected 

from the group consisting of lysine 27, lysine 36, lysine 
37, lysine 42, lysine 56, lysine 64, lysine 79, lysine 
115, lysine 121, lysine 122 and lysine 125 within the 
core of histone H3 may be determined. More preferably, 

15 where the SET polypeptide is a SET2 polypeptide, 
methylation of lysine 36 may be determined. 

A test compound which increases, potentiates, stimulates, 
disrupts, reduces, interferes with or wholly or partially 
20 abolishes methylation of the histone H3 polypeptide and- 
which may thereby modulate SET polypeptide activity, may 
thus be identified and/or obtained. 

Methylation may be determined according to known methods 
25 described herein. 

Agents which increase or potentiate methylation of 
histone H3 may be identified using conditions which, in 
the absence of a positively- testing agent, prevent 
30 methylation. Such agents may be used to potentiate the 

function of a SET polypeptide and may have an effect, for 
example, on transcription and/or DNA replication. 
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In methods of the present invention, a histone H3 
polypeptide may brought into contact with a SET 
polypeptide in the presence of a suitable substrate such 
as SAM (S-adenosyl- (methyl) -L-methionine) . The substrate 
5 maybe labelled, e.g. (S-adenosyl- (methyl- 14 C) -L- 

methionine) and the amount of label on the histone H3 
after incubation with SET polypeptide determined. 

Methods described herein may be useful in determining the 
10 presence of, and optionally quantifying, the amount of 
SET polypeptide in a test sample. This may have a 
diagnostic or prognostic purpose, e.g. in the diagnosis 
or prognosis of any medical condition discussed herein . 
(e;g. a proliferative disorder such as cancer) or in the 
evaluation of a therapy to treat such a condition. 

The characterisation of the histone H3 methylase activity 
of a SET polypeptide as described herein and its role in 
the regulation of cellular proliferation and 
transcriptional activity allows the use of materials and 
methods, such as are disclosed and discussed above, for 
establishing the presence or absence in a test sample, 
for example, obtained from an individual, of aberrant, 
i.e. increased, reduced or abolished SET polypeptide 
histone H3 methylase activity. Such aberrant activity may 
be determined as described herein for the purpose of 
diagnosing a predisposition of an individual to a 
condition associated with cellular proliferation or for 
diagnosing an individual as suffering from a condition 
associated with cellular proliferation, such as cancer. 
The presence of aberrant SET polypeptide histone H3 
methylase- activity being indicative of the individual 
having a condition associated with cellular proliferation 
or being predisposed i.e. having an increased 
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susceptibility to a condition associated with cellular 
proliferation. 

Aberrant expression may be detected at the protein level, 
5 by determining the histone H3 methylase activity of a SET 
polypeptide, as described herein, for example, the 
presence or absence or amount of methylase activity or at 
the nucleic acid level (i.e. DNA or RNA) , by determining 
the presence of a mutant, variant or allele of a SET gene 

10 which encodes a SET polypeptide which has aberrant • 
activity or which expresses aberrant i.e. abolished, 
reduced or increased levels of SET polypeptide. The 
■presence or amount of SET polypeptide expression may be 
determined by determining the presence and/or amount of 

15 mRNA encoding the polypeptide. 

In assay methods according to the invention, the amount 
of test substance or compound which may be added to an 
assay of the invention will normally be determined by 

20 trial and error depending upon the type of compound used. 
Typically, from about 0.001 nM to ImM or more 
concentrations of putative inhibitor- compound may be 
used, for example from 0.01 nM to 100pM,'e.g. 0.1 to 50 
yM, such as about 10 pM. Greater concentrations may be 

25 used when a peptide is the test substance. Even a 

molecule which has a weak effect may be a useful lead 
compound for further investigation and development . 

A test substance, compound or agent used .in an assay 
30 method as described herein may be comprised in a sample, 
mixture or extract, for example, a biological sample. 

A screening or assay method may include purifying and/or 
isolating a test substance and/or substance of interest 
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from a mixture or extract, i.e. reducing the content of 
at least one component of the mixture or extract, e.g. a 
component with which the test substance is naturally 
associated. The screening or assay method may include 
5 determining the ability of one or more fractions of a 
test mixture or extract to modulate the methylase 
activity of the SET polypeptide. 

The purification and/or isolation may employ any method 
10 known to those skilled in the art. 

The precise format of any of the screening or assay 
methods of the present invention may be varied by those 
of skill in the art using routine skill and knowledge. 
15 The skilled person is well aware of the need to employ 
appropriate control experiments . 

Compounds which may be screened using* the assay methods 
described herein may be natural or synthetic chemical 
20 compounds used in drug screening programmes. Extracts of 
plants, microbes or other organisms, which contain 
several characterised or uncharacterised components may 
also be used. 

25 Combinatorial library technology provides an efficient 
way of testing a potentially vast number of different 
substances for ability to modulate an interaction. Such 
libraries and their use are known in the art, for all 
manner of natural products, small molecules and peptides, 

30 among others. The use of peptide libraries may be 
preferred in certain circumstances. 

Methods of determining the methylation of histone H3 by a 
SET polypeptide and of screening for an agent able to 
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modulate the methylation of hi stone H3 by a SET 
polypeptide, include methods in which a suitable end- 
point is used to assess interaction. 

5 Where the SET polypeptide is a SET1 polypeptide, suitable 
end points include the determination of the methylation 
of the lysine 4 residue of histone H3 using methods as 
described herein. 

10 Where the SET polypeptide is a SET2 polypeptide, suitable 
end points include the determination of the methylation 
of histone H3 outside the N terminal 26 amino acids (i.e. 
at a core lysine residue) using methods as described 
herein, for example methylation of lysine 36. 

15 

Methylation may be determined by any convenient method 
known to a skilled person. For example, histone H3 
polypeptide or variant or derivative thereof, may be 
immobilised e.g. on a bead or plate, and methylation of 

20 the appropriate residue detected using an antibody or 

other binding molecule which binds the N terminal region 
of histone H3 with a different affinity when the residue 
is methylated from when residue is not methylated. 
Antibodies may also be used to determine the level of 

25 methylation e.g. di- or tri-methylation, of a histone H3 
lysine residue. Antibodies may be obtained by means of 
any standard technique as discussed elsewhere herein, 
e.g. using a methylated peptide (such as an N terminal 
fragment of histone H3 or a fragment with the N terminal 

30 amino acids deleted) . 



Binding of a molecule which discriminates between the 
methylated and non-methylated form of a histone H3 
polypeptide or between different degrees of methylation 
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may be assessed using any technique available to those 
skilled in the art, which may involve determination of 
the presence of a suitable label. 

5 Methylation may also be assayed in solution, e.g. as 
described in Rea et al (2000), Mature, 406: 593-599. 
Briefly, lOvig of free histone substrate (mixture of HI, 
H2, H3, and H4 ; Boehringer Mannheim) is mixed with 300nCi 
S-adenosyl- [methyl - 14 C] -L-methionine (25 uCi ral" 

.10 1 ) (Amersham) as methyl donor in methylase activity buffer 
(50mM Tris.HCl pH8.5, 20mM KCl , 10mM MgCl 2 , lOmM p- 
mercaptoethanol, 250mM sucrose), to give a final volume' 
of 50pl. 10|ig of SET polypeptide preparation is added 
and .the reaction incubated at 37°C for 60 mins . The 

15 reaction products are then resolved by SDS-PAGE and 

viewed following fluorography of the gel. Alternatively, 
following SDS-PAGE, the resolved proteins can be Western 
blotted to a nitrocellulose membrane, which is then dried 
and exposed to film. 

20 

Methylation may also be determined as described below in 
the examples. 

Of course, the person skilled in the art will design any 
25 appropriate control experiments with which to compare 
results obtained in test assays. 

A histone H3 polypeptide may be a full-length histone "H3 
protein from a eukaryotic cell, such as a yeast or a 
30 mammal, for example a human. The term also includes 

fragments of the full-length protein sequence, such as 
fragments which comprise the lysine 4 residue, for 
example, fragments comprising the N terminal residues of 
the full length protein. Other fragments may comprise 
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residues from the core of histone H3 , i.e. outside the 
amino tail region, such as fragments which comprise a 
lysine residue selected from the group consisting of 
lysine 27, lysine 36, lysine 37", lysine 42, lysine 56, 
5 lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
and lysine 125, for example, a fragment comprising lysine 
36. 

A SET polypeptide may be a full-length SET protein or a 
10 fragment thereof which retains the methylase activity of 
the full length SET protein, . for example a polypeptide 
which comprises the SET domain. 

Examples of SET domains are shown in Figures 1 and 2 . A 
15 SET domain may comprise three conserved motifs; NHSC, 
GEXSY and Erich (Rhea S. et al (2000) Nature 406 593- 
599) . These motifs are indicated in Figure 1 and Figure 
2. 

20 A SET domain of a SET1 polypeptide may further comprise 
conserved motifs I, II, III, IV and V as indicated in 
Figure 1 and Figure 2 . 

A SET polypeptide* may be a member of the SET1 or the SET2 
25 families of histone H3 methylases (HMTs) . 

SET1 polypeptides specifically methylate the lysine 4 
residue of histone H3 and include S. cerevisiae SET1 
(Database Accession Number YHR119W) , S. pombe SET1 
30 (SPCC306.04C) , hSET-lP (Human: KIAA0339) , hSET-2P (Human: 
KIAA1076) ', HRX, TRX2 , KIAA1090, and ALR (all on public 
databases) . SET1 polypeptides may comprise a SET domain 
having greater than 35%, greater than 40%, greater than 
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4 5% or greater than 50% sequence identity with the SET 
domain of S. cerevisiae SET1 . 

An aspect of present invention provides a SET1 
5 polypeptide which methylates lysine 4 of a histone H3 
polypeptide . 

SET2 polypeptides specifically methylate a lysine residue 
selected from the group consisting of lysine 27, lysine' 

10 36, lysine 37, lysine 42, lysine 56, lysine 64, lysine 
79, lysine 115, lysine 121, lysine 122 and lysine 125 
i.e. a 'core lysine of histone H3 outside the N terminal 
26 amino acid tail. More preferably, ' a SET2 polypeptide 
specifically methylates lysine 36. SET2 polypeptides 

15 include S. cerevisiae SET2 (YJL168C) , S. pombe SET2 

(SPAC29B12), hSET2 (AJ238403), WHSC1 (XP_003388), Q9H6H8 ,' 
NSD1, NSD3 and ASH1 (all on public databases) . 

Another aspect of present invention provides a SET2 
20 polypeptide which methylates a core lysine residue of a 
histone H3 polypeptide. 

SET2 polypeptides' may comprise a SET domain having 
greater than 35%, greater than 40%, greater than 45% or 
25 greater than 50% sequence identity with the SET domain of 
S. cerevisiae SET2 . 

A histone H3 or SET polypeptide may. include any suitable 
fragment, variant, derivative, allele or homologue of 
30 histone H3 or the SET polypeptide, which may be employed 
in a method described herein. Suitable fragments, 
variants or derivatives of histone H3 retain the 
biological activity of being methylated by a SET 
polypeptide i.e. they include the appropriate sites of 
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methylation by the SET polypeptide, for example, lysine 4 
or other target residues within the histone H3 core 
(outside the amino tail residues 1 to 26) , for example 
lysine 36, Suitable variants or derivatives of a SET 
5 polypeptide retain the histone H3 methylase activity. 

A polypeptide which is an amino acid sequence variant, 
allele, derivative or mutant of an amino acid sequence 
described herein may comprise an amino acid sequence 

10 which shares greater than about 60% sequence identity 

with the sequence shown, greater than about 7 0%, greater 
than about 80%, greater than about 90% or greater than 
about 95%. The sequence may share greater than about 70% 
similarity, greater than about 80% similarity, greater 

15 than about 90% similarity or" greater than about 95% 

similarity with the amino acid sequence described herein. 

For amino acid "homology" , this may be understood to be 
similarity (according to the established principles of 
20 amino acid similarity, e.g. as determined using the 
algorithm GAP (as described below) or identity. 

Amino acid similarity is generally defined with reference 
to the algorithm GAP (Accelerys, formerly Genetics 

25 Computer Group, Madison, WI) . GAP uses the Needleman and 
Wunsch algorithm to align two complete sequences that 
maximizes the number of matches and minimizes the number 
of gaps. Generally, the default parameters are used, 
with a gap creation penalty = 12 and gap extension 

30 penalty = 4 . Use of GAP may be preferred but other 

algorithms may be used, e.g. BLAST (which uses the method 
of Altschul et al. (1990) J. Mol . Biol. 215: 405-410), 
FASTA (which uses the method of Pearson and Lipman (1988) 
PNAS USA 85: 2444-2448), or the Smith-Waterman algorithm 
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(Smith and Waterman (1981) J. Mol Biol. 147: 195-197), 
generally employing default parameters. 

Similarity allows for "conservative variation", i.e. 
5 • substitution of one hydrophobic residue such as 

isoleucine, valine, leucine or methionine for another, or 
the substitution of one polar residue for another, such 
as arginine for lysine, glutamic for aspartic acid, or 
glutamine for asparagine. Particular amino acid sequence 
10 variants may differ from a sequence described herein by 
insertion, addition, substitution or deletion of 1 amino 
acid, 2, 3, 4, 5-10, 10-20 20-30, 30-50,. 50-100, 100-150, 
or more than 150 amino acids. 

15 Sequence comparison may be made over the full-length of 
the relevant sequence shown herein, or may more 
preferably be over a contiguous sequence of about or 
greater than about 20, 25, 30, 33, 40, 50, 67, 133, 167, 
200, 233, 267, 300, 333, 400 or more amino acids, 

20 compared with the relevant amino acid sequence. 

As described above, peptides which include or consist of 
fragments of a full-length SET protein are encompassed 
within the term % SET polypeptide' as defined herein. 
25 Where the SET polypeptide is a fragment of the full- 
length protein, suitable fragments are generally those 
which retain the methylase activity of the full-length 
polypeptide and are capable of methylating histone H3 , 
except where otherwise stated. 



30 



A further aspect of the present invention provides a 
fragment of a full-length SET polypeptide as described 
herein, said fragment having histone H3 methylase 
activity. 
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Preferably such a fragment comprises the SET methylase 
domain. 

5 A suitable fragment preferably methylates histone H3 

outside lysine 9, more preferably at lysine 4 or a lysine 
residue selected from the group consisting of lysine 27, 
lysine 36, lysine 37, lysine 42, lysine 56, lysine 64, 
lysine 79, lysine 115, lysine 121, lysine 122 and lysine 
10 125 i.e. a core lysine residue of histone H3 located 

outside the first 26 N terminal amino acids, for example, 
the central and C terminal regions, such as lysine 36. 

A fragment of a SET polypeptide includes a fragment of a 
15 SET1 polypeptide, which may be any member of the SET1 
family, including hSET-lP (human KIAA0339) and hSET-2P 
(human KIAA1076) and a fragment of a SET2 polypeptide, 
which may be any member of the SET2. protein family, 
including hSET2-l, hSET2-2, hSET2-3 and WHSC1 . 

20 

Another aspect of the present invention provides an 
isolated nucleic acid molecule comprising a nucleotide 
sequence which encodes a SET polypeptide. A SET 
polypeptide may include a fragment of a full-length SET 
25 polypeptide sequence described herein. 

Such a nucleotide sequence may be operably linked to a 
heterogeneous regulatory element, such as a promoter or 
enhancer element.. A nucleotide sequence encoding a SET 
30 polypeptide as described above may be comprised within a 
vector, in particular an expression vector. 

Vectors comprising nucleic acid encoding a SET 
polypeptide may be transformed into a suitable host cell 
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as described above to provide for expression of the SET 
polypeptide. Thus, in a further aspect the invention 
provides a process for preparing a SET polypeptide which 
includes cultivating a host cell transformed or 
5 transfected with an expression vector as described above 
under conditions to provide for expression by the vector 
of a coding sequence encoding the SET polypeptide, .and 
recovering the expressed polypeptide. Polypeptides may 
also be expressed in in vitro systems, such as 
10 reticulocyte lysate . 

Following production of a SET polypeptide, it may be 
tested for histone H3 methytransf erase activity, e.g. by 
determination of methylation of lysine 4 or a core lysine 
residue of histone H3 • selected from the group consisting 
of lysine 27, lysine 36, lysine 37, lysine 42, lysine 56, 
lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
and lysine 125, preferably lysine 36 as described herein 
on incubation of the polypeptide with a histone H3 
polypeptide. 

Thus the present invention also provides a method of 
producing a SET polypeptide comprising; 

expressing a SET polypeptide from encoding nucleic 
25 acid, and; 

determining the methyl transferase activity of the 
expressed SET polypeptide. 

The raethyltransferase activity may include the 
30 methylation of a histone H3 residue other than lysine 9, 
for example lysine 4 or a lysine residue selected from 
the group consisting of lysine 27, lysine 36, lysine 37, 
lysine 42, lysine 56, lysine 64, lysine 79, lysine 115, 
lysine 121, lysine 122 and lysine 125 i.e. a core lysine 



15 
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residue of hi stone H3 located outside the first 26 N 
terminal amino acids, for example lysine 36. 
Methyl trans f erase activity may also include the di- or 
. tri- methylation of such a residue. 

5 

It is not necessary to use the entire proteins for assays 
of the invention which te^st for binding between two 
molecules. Fragments may be generated and used in any 
suitable way known to those of skill in the art. 

10 ■ Suitable ways of generating fragments include, but are 

not limited to, recombinant expression of a fragment from 
encoding DNA. Such fragments may be generated by taking 
encoding DNA, identifying suitable restriction" enzyme 
recognition sites either side of the portion to be 

15 expressed, and cutting out said portion from the DNA. 
The portion may then be operably linked to a suitable 
promoter in a standard commercially available expression 
system. Another recombinant approach is to amplify the 
relevant portion of the DNA with suitable PCR primers. 

20 Small fragments (e.g. up to about 20 or 30 amino acids) 
may also be generated using peptide synthesis methods • 
which are well known in the. art. 

The skilled person can use the techniques described 
25 herein and others well known in the art to produce large 
amounts of peptides, for instance by expression from 
encoding nucleic acid. 

Methods of obtaining agents able to modulate the histone 
30 H3 methylation activity of SET polypeptides include 

methods wherein a suitable end-point is used to assess 
interaction in the presence and absence of a test 
substance. 
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For methylation assays, a histone H3 or SET polypeptide 
which is a full-length protein, truncated portion, or a 
portion of fused to another protein (e.g. GST), or a 
suitable variant or derivative of any of these, may be 
5 used. 

Peptide methylation assays may use peptides that comprise 
the region of histone H3 which is methylated. For a SET1 
polypeptide, this may include the N- terminal region of 
10 histone H3 comprising lysine 4. 

For a SET2 polypeptide, this may include a lysine residue 
selected from the group consisting of lysine 27, lysine 
36, lysine 37, lysine 42, lysine 56, lysine 64, lysine 
15 79, lysine 115, lysine 121, lysine 122 and lysine 125, 

i.e. a core lysine residue of histone H3 located outside 
the first 26 N terminal amino acids, for example, the 
central and C terminal regions. Preferably, for a SET2 
polypeptide, this may include lysine 36. 

20 

The methylation of histone H3 may be assayed by any of a 
variety of procedures such as discussed below and may be 
adapted to high throughput screening approaches . Of c 
particular interest is the methylation of the lysine 4 
25 residue of histone H3 by hSET-lP. (human KIAA03 39) and/or 
hSET-2P (human KIAA1076) and methylation in the central 
and/or C terminal portions (i.e. the core) of histone H3 
by hSET2 (AJ23 84 03 ) and/or WHSC1 (XP_003388) . 

30 A method of the present invention may comprise 

identifying and/or obtaining a test compound which 
modulates the histone H3 methylase activity of a SET 
polypeptide. 
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Following identification of a test compound which is an 
agent or modulator as described herein, it may be 
investigated further. For example, a compound, substance 
or molecule which tests positive for ability to modulate 
5 methylation of the appropriate residue of hi stone H3 

and/or the methylase activity of a SET polypeptide may be 
isolated, purified and/or manufactured. A method of the 
present invention may thus comprise, isolating and/or 
purifying and/or manufacturing or synthesising the test 
10 compound. 

Furthermore, the test compound may be used in 
preparation, i.e. manufacture or formulation, of a 
composition such as a medicament, pharmaceutical 
composition or drug. This may be administered to 
individuals. A method of the. present invention may thus 
comprise formulating said compound with a 
pharmaceutically acceptable excipient as described 
herein. 

In various aspects, the present invention thus provides a 
modulator identified and/or obtained by a screening 

r 

method of the invention, e.g. a substance which inhibits 
or reduces, increases or potentiates the histone H3 • 
methylase activity of a SET polypeptide. 

Following identification of a modulator, the substance 
may be purified and/or investigated further and/or 
manufactured. A modulator may be used to obtain peptidyl 
30 or non-peptidyl mimetics, e.g. by methods well known to 
those skilled in the art and discussed herein. It may be 
used in a therapeutic context as discussed below. 
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Agents according to the present invention useful in 
modulating the methylation of hi stone H3 and therefore 
one or more of its functions within the chromatin, may 
modulate the methylase activity of the SET polypeptide. 
5 Such agents may specifically inhibit the ability of the 
SET polypeptide to methylate the appropriate residue of 
histone H3 or provide the appropriate level of 
methylation. Assays and screens for such agents are 
provided in accordance with the present invention, '"along 
10 with the agents themselves and their use in modulating 
the methylation and thereby the function of histone H3 . 

An agent able to inhibit methylation of histone H3 by a 
SET polypeptide may include a substance able to affect 

15 the catalytic properties of the enzymatically active site 
of the methylase. An inhibitor of methylation may 
interact with the SET polypeptide within the SET 
methylase domain. Residues within this domain are 
involved in interaction with histone H3 and catalysis of 

20 the methylation. Residues outside of the domain may also 
be involved in interacting with histone H3 and agents 
which interfere with such interaction may also affect the 
methylation as discussed elsewhere herein. 

25 Agents useful in accordance with the present invention 
may be identified by screening techniques which involve 
determining whether an agent under test inhibits or 
disrupts the methylation by a SET polypeptide of the 
histone H3 polypeptide. 

30 

One class of putative modulator compounds can be derived 
from the SET polypeptide sequence and/or a ligand with 
which. it interacts, such as histone H3 . Peptide fragments 
of these polypeptides or alleles, mutants or derivatives 
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of such fragments are described herein. Nucleic acid 
encoding such peptides, vectors and. host cells containing 
such nucleic acid, and methods of expressing nucleic acid 
encoding such peptides are further aspects of the present 
5 invention. 

Where the histone H3 polypeptide is a fragment of the 
full length protein, suitable fragments are those which 
contain the appropriate residue which is methylated by a 

10 SET polypeptide in the full-length polypeptide, and are 
themselves capable of - being methylated by a SET - 
polypeptide. Smaller fragments, and analogues and 
variants of these fragment may similarly be employed, 
e.g. as identified using techniques such as deletion 

15 analysis or alanine scanning. 

Fragments methylated by a SET1 polypeptide include 
fragments comprising the N terminal of histone H3, in 
particular lysine 4. Fragments methylated by a SET2 

20 polypeptide include fragments comprising a lysine residue 
selected from the group consisting of lysine 27, lysine 
36, lysine 37, lysine 42, lysine 56, lysine 64, lysine 
79, lysine 115, lysine 121, lysine 122 and lysine 125 
i.e. a core lysine residue of histone H3 located outside 

25 the first 26 N terminal amino acids, in particular lysine 

Thus, the present invention provides a SET polypeptide, 
which is preferably a peptide fragment, which is able to 
30 inhibit methylation of histone H3 . 

Such peptide fragments may be obtained by means of 
deletion analysis and/or alanine scanning of the relevant 
protein - making an appropriate mutation in sequence, 
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bringing together a SET polypeptide and a histone H3 
polypeptide in the presence of the mutated fragment, and 
determining methylation of the histone H3 polypeptide. 
In preferred embodiments, the peptide is short, as 
5 discussed below, and may be a minimal portion which is 
able to interact with the relevant counterpart protein 
and inhibit methylation. 

. A "fragment" of a polypeptide generally means a stretch 
10 of amino acid residues of at least about five contiguous 
amino acids, often at least about seven contiguous amino 
acids, typically at least about nine contiguous amino 
acids, more preferably at least about 13 contiguous amino 
acids, and, more preferably, at least about 20 to 3 0 or 
15 more contiguous amino acids. Fragments of a SET 
polypeptide may include antigenic determinants or 
epitopes useful for raising antibodies to a portion of 
the amino acid sequence . Alanine scans are commonly used 
to find and refine peptide motifs within polypeptides, 
20 this involving the systematic replacement of each residue . 
in turn with the amino acid alanine, followed by an 
assessment of biological activity. 

Where the SET polypeptide is a SET1 polypeptide and the 
25 histone H3 polypeptide is a fragment, the fragment 
generally comprises the lysine 4 residue. 

Where the SET polypeptide is a SET2 polypeptide and the 
histone H3 polypeptide is a fragment, the fragment 
30 generally comprises a lysine residue selected from the 

group consisting of selected from the group consisting of 
lysine 27, lysine 36, lysine 37, lysine 42, lysine 56, 
lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
and lysine 125 i..e. a core lysine residue of histone H3 
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located outside the first 26 N terminal amino acids. More 
preferably the fragment comprises lysine 36. 



Peptides in accordance with the present invention tend to 
5 be short, and may be about 4 0 amino acids in length or 
less, preferably about 35 amino acids in length or less, 
more preferably about 3 0 amino acids in length, or less, 
more preferably about 25 amino acids or less, more 
preferably about 20 amino acids or less, more preferably 

10 about 15 amino acids or less, more preferably about 10 
amino acids. or less, or 9, 8, 7, 6, 5 or less in length. 
Peptides according to the present invention may be about 
10-40 amino acids in length, about 5-10, about 10-15, 
about 10-20, about 10-30, about 20-30, or about 30-40 

15 amino acids in length. Peptides which are histone H3 

fragments generally include one or more of the relevant 
lysine residues. 

The present invention also encompasses peptides which are 
20 sequence variants or derivatives of a wild-type SET 

polypeptide sequence. Peptides which are variants of 
wild-type SET polypeptide sequence retain the ability to 
modulate methylation of histone H3 by a SET polypeptide. 

25 A SET polypeptide or histone H3 polypeptide may be a 

peptide or polypeptide which may include an amino acid 
sequence which differs by one or more amino acid residues 
from the wild-type amino acid sequence, by one or more of 
addition, insertion, deletion and substitution of one or 

30 more amino acids. Thus, variants, derivatives, alleles, 
mutants and homologues, e.g. from other organisms, are 
included. 
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Preferably, the amino acid sequence shares homology with 
a region of the relevant SET polypeptide or histone 
H3 polypeptide as referenced herein, preferably at least 
about 60%, or 70%, or 75%, or 80%, or 85%, 90% or 95% 
5 homology. Thus, a peptide fragment of a SET polypeptide 
or histone H3 polypeptide may include 1, 2, 3, 4, 5, 
greater than 5, or greater than 10 amino acid alterations 
such as substitutions with respect to the wild-type 
sequence. 

A derivative of a peptide for which the specific sequence 
is disclosed herein may be in certain embodiments the 
same length or shorter than the specific peptide. In 
other embodiments the peptide sequence or a variant 
thereof may be included in a larger peptide, as discussed 
above, which may or may not include an additional portion 
of SET polypeptide or histone H3 polypeptide. 1, 2, 3, 4 
or 5 or more additional amino acids, adjacent to the 
relevant specific peptide fragment of SET polypeptide or 
histone H3 polypeptide, or heterologous thereto may be 
included at one end or both ends of the peptide. 

Antibodies directed to the site of binding of SET 
polypeptides form another class of putative modulators of 
SET methylase activity. Candidate inhibitor antibodies 
may be characterised and their binding regions determined 
to provide single chain antibodies and fragments thereof 
which are responsible for disrupting the binding. 

Antibodies may be obtained using techniques which are 
standard in the art. Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, 
horse, goat, sheep or monkey) with a SET polypeptide. 
Antibodies may be obtained from immunised animals using 
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any of a variety of techniques known in the art, and 
screened, preferably using binding of antibody to antigen 
of interest. For instance, Western blotting techniques 
or immunoprecipitation may be used (Armitage et al., 
5 1992, Nature 357: 80-82). Isolation of antibodies and/or 
antibody-producing cells from an animal may be 
accompanied by a step of sacrificing the animal . 

As an alternative or supplement to immunising a. mammal 
10 with a peptide, an antibody specific for a SET 

polypeptide may be obtained from a recombinant ly produced 
library of expressed immunoglobulin variable domains, 
e.g. using lambda bacteriophage or filamentous 
bacteriophage which display • functional immunoglobulin 
15 binding domains on their surfaces; for instance see 
WO92/01047. The library may be naive, that is 
constructed from sequences obtained from an organism 
which has not been immunised with any of the proteins (or 
fragments) , or may be one constructed using sequences 
20 obtained from an organism which has been exposed to the 
antigen of interest . 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term "antibody" 

25 should be construed as covering any binding substance 

having an immunoglobulin binding domain with the required 
specificity. Thus the invention covers antibody 
fragments, derivatives, functional equivalents and 
homologues of antibodies, including synthetic molecules 

30 and molecules whose shape mimics that of an antibody 
enabling it to bind an antigen or epitope. 

A hybridoma producing a monoclonal antibody according to 
. the present invention may be subject to genetic mutation 
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or other changes. It will further be understood by those 
skilled in the art that a monoclonal antibody can be 
subjected to the techniques of recombinant DNA technology 
to produce other antibodies or chimeric molecules which 
5. retain the specificity of the original antibody. Such 
techniques may involve introducing DNA encoding the 
immunoglobulin variable region, or the complementarity 
determining regions (CDRs) , of an antibody to the 
constant regions, or constant regions plus framework 
10 regions, of a "different immunoglobulin. See, for 
instance, EP184187A, GB 2188638A or EP-A-0239400 . 
Cloning and expression of chimeric antibodies are 
described in EP-A-0120694 and EP-A-0125023 . 

15 Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the 
present invention, as are host cells, eukaryotic or 
prokaryotic, containing nucleic acid encoding antibodies 
(including antibody fragments) and capable of their 

20 expression. The invention also provides methods of 

production of the antibodies including growing a cell 
capable of producing the antibody under conditions in 
which the antibody is produced, and preferably secreted. 

25 The reactivities of antibodies on a sample may be 
determined by any appropriate means . Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 

30 linkage of reporter molecules may be directly or 

indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a 
result of recombinant expression of a gene fusion 
encoding antibody and reporter molecule. The mode of 
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determining binding is not a feature of the present 
invention and those skilled in the art are able to choose 
a suitable mode according to their preference and general 
knowledge . 

5 

Antibodies may also be used in purifying and/or isolating 
SET polypeptide, for instance following production of 
the polypeptide by expression from encoding nucleic acid. 
Antibodies may be useful in a therapeutic context (which 

10 may include prophylaxis) to disrupt binding of a SET 

polypeptide to histone H3 and inhibit the methylation of 
the target lysine residue. Antibodies can for instance 
be micro- injected into cells, e.g. at a tumour site, 
subject to radio- and/or chemo- therapy (as discussed 

15 already above) . Antibodies may be employed in accordance 
with the present invention for other therapeutic and non- 
therapeutic purposes which are discussed 
elsewhere herein. 

20 As noted, the agent may be peptidyl, e.g. a peptide which 
includes a sequence as recited above, or may be a 
functional analogue of such a peptide. 

As used herein, the expression "functional analogue" 
25 relates to peptide variants or organic compounds having 
the same functional activity as the peptide in question, 
which may interfere with the methylation of the lysine 4 
residue of histone H3 by a SET polypeptide. Examples of- 
such analogues include chemical compounds which are 
30 modelled to resemble the three dimensional structure of 
the SET methylase domain in the contact area, and in 
particular the arrangement of the key amino acid 
residues. 
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In a further aspect, the present invention provides the 
use of a SET polypeptide, in particular a peptide 
fragment, which is capable of methylating the lysine 4 
residue of histone H3 , in a method of designing a peptide 
5 or non-peptidyl mimetic, which mimetic is able to 
interact with the SET polypeptide active site and 
modulate the methylation of the lysine 4 residue by the 
SET polypeptide. 

10 Accordingly, the present invention provides a method of 
designing a mimetic, for example of a histone. H3 amino 
terminal fragment, which has the biological activity of 
modulating the methylation of the lysine 4 residue by the 
SET polypeptide, said method comprising: 

15 (i) analysing a substance to determine the amino 

acid residues essential and important for the biological 
activity to define a pharmacophore; and, 

(ii) modelling the pharmacophore to design and/or 
screen candidate mimetics which modulate the methylation 

20 as described. 

Suitable modelling techniques are known in the art. This 
includes the study of the bonding between SET and histone 
H3 and the design of compounds which contain 
25 corresponding functional groups arranged in such a manner 
that they could reproduce that bonding. 

The designing of mimetics to a known pharmaceutical ly 
active compound is a known approach to the development of. 
30 pharmaceuticals based on a "lead" compound. This might 
be desirable where the active compound is difficult or 
expensive to synthesise or where it is unsuitable for a 
particular method of administration, for instance SET 
polypeptides may not be well suited as active agents for 
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oral compositions as they tend to be quickly degraded by 
proteases in the alimentary canal. 



There are several steps commonly taken in the design of a 
5 mimetic from a compound having a given target property. 
Firstly, the particular parts of the compound that are 
critical . and/or important in determining the target 
property are determined. In the case of a peptide/ this 
can be done by systematically varying the amino acid 
10 residues in the peptide, e.g. by substituting each 

residue in turn. These parts or residues constituting 
the active region of the compound are known as its 
"pharmacophore" . 

15 Once the pharmacophore has been found, its structure is 
modelled according to its physical properties, e.g. 
stereochemistry, bonding, size and/or charge, using data 
from a range of sources, e.g. spectroscopic techniques, 
X-ray diffraction data and NMR. Computational analysis, 

20 similarity mapping (which models the charge and/or volume 
of a pharmacophore, rather than the bonding between 
atoms) and other techniques can be used in this modelling 
process . 

25 In a variant of the above approach, the three-dimensional 
structure of a ligand and its binding partner are 
modelled. This can be especially useful where the ligand 
and/or binding partner change conformation on binding, 
allowing the model- to take account of this the design of 

30 the mimetic. 



A compound found to have the ability to affect SET 
methylase activity has therapeutic and other potential in 
a number of contexts, as discussed. For therapeutic 
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treatment such a compound may be used in combination with 
any other active substance, e.g. for ant i- tumour therapy 
another ant i- tumour compound or therapy, such as 
radiotherapy or chemotherapy. 

5 

An agent identified using one or more primary screens 
(e.g. in a cell-free system) as having ability to 
.modulate the methylase activity of a SET polypeptide may 
be assessed further using one or more secondary screens. 

10 A secondary screen may involve testing for a biological 
function of histone H3 methylated at the lysine 4 ■ 
position or a lysine residue selected from the group 
consisting of lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 

15 lysine 122 and lysine 125 i.e. a core lysine residue of 
histone H3 located outside the first 26 N terminal amino 
acids, preferably lysine 36. Suitable biological 
functions include the transcriptional activation of a 
chromatin region or domain which comprises or is 

20 associated with the histone H3 polypeptide and/or binding 
to a regulatory factor such as the NuRD repressor. 

Generally, a modulator according to the present invention 
is provided in an isolated and/or purified form, i.e. 

25 substantially pure. This may include being in a 

composition where it represents at least about 90% active 
ingredient, more preferably at least about 95%, more 
preferably at least about 98%. Such a .composition may, 
however, include inert carrier materials or other 

30 pharmaceutical^ and physiologically acceptable 

excipients. As noted below, a composition according to 
the present invention may include in addition to an 
modulator compound as disclosed, one or more other 
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molecules of therapeutic use, such as an ant i- tumour 
agent . 

The invention further provides a method of treatment which 
5 includes administering to a patient an agent as described 
herein which interferes with or inhibits the methylation 
of the target residue of hi stone H3 (for example lys 4 or 
a core lys residue) by a SET polypeptide. Exemplary 
purposes of such treatment are discussed elsewhere herein. 

10 

The invention further provides various therapeutic. methods 
and uses of one or more compounds or substances selected 
. from (i) SET polypeptide which is able to bind to histone 
H3; (ii) a modulator identified by a screening method of 
15 the present invention; (iii) a mimetic of any of the above 
substances which can bind to histone H3 or a SET 
polypeptide . 

The therapeutic/prophylactic purpose of such a method or 
20 use may be the modulation, e.g. disruption or 

interference, of the methylation of histone H3 at 
residues other than lysine 9. Target residues include 
lysine 4* for SET1. For a SET2 polypeptide, this may 
include a lysine residue selected from the group 
25 consisting of lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 
lysine 122 and lysine 125 i.e. a core lysine residue of 
histone H3 located outside the first 26 N terminal amino 
acids, for example, within the central and C terminal 
30 regions, preferably lysine 3 6 • 



The therapeutic/prophylactic purpose may be: 
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(i) Cancer treatment, which may for example be in 
combination with chemotherapy and/or radiotherapy. 

(ii) Cancer prophylaxis, 

5 (iii) Treatment of other proliferative disorders 

described herein e.g. psoriasis, cataracts, multiple 
myeloma . 

In various further aspects, the present invention thus 

10 provides a pharmaceutical composition, medicament, drug 
or other composition for .such a purpose, the composition 
comprising one or more such compounds or substances which 
modulate the activity of a SET polypeptide as described 
herein and/or the methylation of histone H3 at a lysine 

15 residue other than Lys9, the use of such* a substance in a 
method of medical treatment, a method comprising 
administration of such a substance to a patient, e.g. for 
treatment (which may include preventative treatment) of a 
medical condition, e.g. a condition associated with a 

20 defect or disorder in transcriptional control, DNA 

replication, or cell cycle control, e.g. for treatment of 
a disorder of cellular proliferation such as restenosis, 
psoriasis, cataracts, multiple myeloma and cancer, 
preferably all types of solid cancers and malignant 

25 lymphomas and especially leukaemia, skin cancer, bladder 
cancer, breast cancer, uterus cancer, ovary cancer, 
prostate cancer, lung cancer, colon cancer, pancreas 
cancer, renal cancer, stomach cancer and cerebral cancer, 
use of such a substance in the manufacture of a 

30 composition, medicament or drug for administration for 
such a purpose, e.g. for treatment of a proliferative 
disorder, and a method of making a pharmaceutical 
composition comprising admixing such a substance with a 
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pharmaceutically acceptable excipient, vehicle or 
carrier, and optionally other ingredients. 

The substances may. be used as sole active agents or in 
5 combination with one another or with any other active 
substance, e.g. for anti-tumour therapy another anti- 
tumour compound or therapy, such as radiotherapy or 
chemotherapy . 

Whatever the substance used in a method of medical 
treatment of the present invention, administration is 
preferably in a "prophylactically effective amount" or a 
"therapeutically effective amount" (as the case may be, 
although prophylaxis may be considered therapy) , this * 
being sufficient to show benefit to the individual. The 
actual amount administered, and rate and time-course of 
administration, will depend on the nature and severity of 
what is being treated. Prescription of treatment, e.g. 
decisions on dosage etc, is within the responsibility of 
general practitioners and other medical doctors. 

A substance or composition may be administered alone or 
in combination with other treatments, either 
simultaneously or sequentially dependent upon the 
25 condition to be treated, e.g. cancer. 

Pharmaceutical compositions according to the present 
invention, and for use in accordance with the present 
invention, may include, in addition to active ingredient, 
30 a pharmaceutical^ acceptable excipient, carrier, buffer, 
stabiliser or other materials well known to those skilled 
in the art. Such materials should be non-toxic and 
should not interfere with the efficacy of the active 
ingredient. The precise nature of the carrier or other 



10 



15 



20 




WO 02/090578 



PCT/GB02/02050 



36 



material will depend on the route of administration, 
which may be oral, or by injection, e.g. cutaneous, 
subcutaneous or intravenous. 

5 Pharmaceutical compositions for oral administration may 
be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an 
adjuvant. Liquid pharmaceutical compositions generally 
include a liquid carrier such as water, petroleum, animal 
10 or vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other 
saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

15 For intravenous, cutaneous or subcutaneous injection, or 
injection at the site of affliction, the active 
ingredient will be in the form of a parenterally 
acceptable aqueous solution which is pyrogen-free and has 
suitable pH, isotonicity and stability. Those of 

20 relevant skill in the art are well able to prepare 

suitable solutions using, for example, isotonic vehicles 
such as Sodium Chloride Injection, Ringer's Injection, 
Lactated Ringer's Injection. Preservatives, stabilisers, 
buffers, antioxidants and/or other additives may be 

25 included, as required. 

Liposomes, particularly cat ionic liposomes, may be used 
in carrier formulations. 



30 Examples of techniques arid protocols mentioned above can 
be found in Remington's Pharmaceutical Sciences, 16th 
edition, Osol , A. (ed) , 1980. 
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The substance or composition may be administered in a 
localised manner to a tumour site or other desired site 
or may be delivered in a manner in which it targets 
tumour or other cells. 

5 

Targeting therapies may be used to deliver the active 
substance more specifically to certain types of cell, by 
•the use of targeting systems such as antibody or cell 
specific ligands. Targeting may be desirable for a 
10 variety of reasons, for example if the agent is 

unacceptably toxic, or if it would otherwise require too 
high a dosage, or if it would not .otherwise be able to 
enter the target cells. 

15 Instead of administering such substances directly, they 
may be produced in the target cells by expression from an 
encoding nucleic acid introduced into the cells, e.g. 
from a viral vector. The vector may be targeted to the 
specific cells to be treated, or it may contain 

20 regulatory elements which are switched on more or less 
selectively by the target cells. 

Nucleic acid encoding the substance e.g. a peptide able 
to modulate, e.g. interfere . with, the methylation of the 
25 lysine 4 or core lysine residue of histone H3 by a SET 

polypeptide, may thus be used in methods of gene therapy, 
for instance in treatment of individuals, e.g. with the 
aim of preventing or curing {wholly or partially) a 
disorder. 



30 



Vectors such as viral vectors have been used in the prior 
art to introduce nucleic acid into a wide variety of 
different target cells. Typically the vectors are 
exposed to the target cells so that transfection can take 
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place in a sufficient proportion of the cells to provide 
a useful therapeutic or prophylactic effect from the 
expression of the desired peptide. The transfected 
nucleic acid may be permanently incorporated into the 
genome of each of the targeted cells, providing long 
lasting effect, or alternatively the treatment may have 
to be repeated periodically. 

A variety of vectors, both viral vectors and plasmid 
vectors, are known in the art, see US Patent No. 
5,252,479 and WO 93/07282. In particular, a number of 
viruses have been used as gene transfer vectors, 
including papovaviruses ; such as SV40, vaccinia virus, 
herpesviruses, including HSV and EBV, .and retroviruses. 
Many gene therapy protocols in the prior art have used 
disabled murine retroviruses. 

As an alternative to the use of viral vectors in gene 
therapy other known methods of introducing nucleic acid 
into cells includes mechanical techniques such as 
microinjection, transfer mediated by liposomes and 
receptor-mediated DNA transfer. 

Receptor-mediated gene transfer, in which the nucleic 
acid is linked to a protein ligand via polylysine, with 
the ligand being specific for a receptor present on the 
surface of the target cells, is an example of a technique 
for specifically targeting nucleic acid to particular 



A peptide or other substance having an ability to 
modulate or interfere with the methylation of the lysine 
4 or core lysine residue of his tone H3 by a SET 
polypeptide, a nucleic acid molecule which encodes a 
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peptide having that ability, may be provided in a kit, 
e.g. sealed in a suitable container which protects its 
contents from the external environment . Such a kit may 
include instructions for use. 

In still further aspects the present invention provides 
for the purification of a SET polypeptide, or for the 
purification of a histone H3 polypeptide. The invention 
also provides for a purified SET polypeptide protein. 
The purified polypeptide may be about 10% pure, more 
preferably about 20% pure, more preferably about 30% 
pure, more preferably about 40% pure, more preferably 
about 50% pure, more preferably about 6 0% pure, more 
preferably about 70% pure, more preferably about 80%. 
pure, more preferably about 90% pure, more preferably 
about 95% pure, or substantially pure. 

In another aspect the present invention provides a method 
of purifying a SET polypeptide, the method including 
contacting the SET polypeptide with histone H3 
polypeptide. 

A mixture of material including SET polypeptide may be 
contacted against immobilised histone H3 polypeptide 
(e.g. immobilised either covalently or non-covalently 
such as via a specific binding molecule such as 
streptavidin or biotin) and molecules which do not bind 
to the histone H3 polypeptide are washed off. 

Following purification, the SET polypeptide may be used 
as desired, e.g. in an assay for an agent which modulates 
its activity, e.g. binding, in raising or obtaining a 
specific antibody or other binding molecule, or in a 
therapeutic context. 
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Other aspects of the present invention relate to the 
disruption of the interaction between histone H3 and the 
NuRD complex by the methylation of histone H3 lysine 4 by 
5 a SET1 polypeptide. 

A method for identifying and/or obtaining an agent which 
modulates the interaction of histone H3 with a NuRD 
repressor, may include: 
10 (a) bringing into contact one or more components of the 
NuRD repressor complex and a histone H3 polypeptide in 
the presence of a test compound; and, 
(b) determining interaction between the histone H3 
polypeptide and the said one. or more components. 



The one or more components of the NuRD . repressor complex 
and the histone H3 polypeptide may be brought together in 
the presence of a SET1 polypeptide. A reduction in the 
activity of the SET1 polypeptide in methylating the lys 4 
20 residue of the histone H3 polypeptide may be determined 
by an increase in interaction and/or binding between the 
NuRD complex and the histone H3 polypeptide in the 
presence relative to the absence of the test compound. 

25 Components of the NuRD complex may include HDAC1, HDAC2, 
Mi -2 (3, Rbap4 8, Rbap4 6, MTA1, MTA2 and MBD3 (Zhang et al 
(1998) Cell 95(2)279-289). Histone H3 polypeptides are 
described above. 

30 Interaction and/or binding may be determined by any 
convenient technique as described above. A agent 
identified and/or obtained using such a method may be 
useful in the modulation of transcriptional activation. 
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Other aspects of the present invention relate to the tri- 
methylation of histone H3 lysine 4 by a SET1 polypeptide 
and the effect of lys 4 tri -methylation on 
transcriptional activation . 

A method of identifying transcriptionally active 
chromatin may comprise; 

determining the presence or absence of a 
trimethylated Lys 4 residue in a histone H3 polypeptide 
of said chromatin. 

The presence of a trimethylated Lys4 residue in a histone 
H3 polypeptide of said chromatin is indicative that said 
chromatin is transcriptionally active. 

The presence or absence of a trimethylated Lys 4 may be 
determined by. any convenient means. For example, said 
chromatin region may be contacted with an antibody which 
binds specifically to a histone H3 polypeptide which is 
trimethylated at Lys 4 . 

Various further aspects and embodiments of the present 
invention will be apparent to those skilled in the art in 
view of the present disclosure. 

Certain aspects and embodiments of the invention will now 
be illustrated by way of example and with reference to 
the figure described below. 

Figure 1 shows sequence alignments within the SET domains 
and flanking sequences of yeast proteins containing SET 
domains, human homlogues of SET1 and the human histone H3 
lysine 9 methylase Suv39hl. hSetl.l, human KiAA0339; 
hSetl.2, human KIAA1076; sp Setl, S.powbe SPCC306.04C; sc 
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Setl, S . cerevisiae Setlp; sc Set2, S . cerevisiae Set2p; 
Suv39Hl, human Suvar 39H1; sc Set3, S . cerevisiae YJL105W; 
sc Set4 / S . cerevisiae YKR029C; sc Sets, S . cerevisiae 
YHR207C; sc Set6, S . cerevisiae YPL165C. Alignments were 
5 calculated as follows: Clustal Wl . 8 (DNA-Protein) -Global 
progressive (BCM) > BOXADE 3.21, Consensus line: No 
consensus, Fraction of sequences (that must agree for 
shading) : 0 . 5 . 

10 Figure 2 shows sequence alignments within the SET domains 
and flanking sequences of SET1 and SET2 polypeptides. 
hSetl.l, human KiAA0339; hSetl.2, human KIAA1076; sc 
Sett, S . cerevisiae Setlp; spSet2, S. pombe SPAC29B12; sc 
Set2, S . cerevisiae Set2p; hSet2.1, human K1AA1732; 

15 hSet2.2 / human HSPC069 ; hSet2.3, human AJ2384 03; WHSC17 
human WHS CI . 

Figure 3 shows the" mapping of the methylation site of the 
S. cerevisiae SET1 in the histone H3 amino tail. 

20 

Figure 4 shows a relationship tree between proteins which 
contain SET domains. The tree was compiled using the 
following multiple alignment parameters; program - 
ClustalW (vi.4), similarity matrix - blosum 7 Open Gap 
25 penalty - 10. 0, Extend Gap penalty - 0.1, Delay Divergent 
- 40%, Gap distance -.8. 

Figure 5 shows a schematic diagram of various methylated 
and unmethylated histone H3 peptides used to purify a 
30 specific set of binding proteins. The C-terminus of each 
of the peptides was followed by GGC. 

Figure 6 shows the deacetylase activity and released 
tritiated acetate, as measured by scintillation counting 
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(c pm = counts per minute) from proteins bound to the 
unmethylated and methylated hi stone H3 peptide in pull 
down assays. 

5 Figure 7 shows a list of genes activated by SET1 . To 
identify targets of SET1 mediated expression, 
transcription profiles of setlA yeast were determined. 
Potential targets were collated from genes down regulated 
in the mutant and assigned relative merit (P-value) using 
10 a gene-specific error model. Secondary effects were 

eliminated by disregarding genes also down regulated in 
general stress or in a sir2A profile (sir2A and sirlA 
yeast both exhibit downstream effects secondary to 
derepression of the silent mating loci) . 

15 

Table 1 shows % identity within the SET Domain among the 
SET1 and SET2 Families. The table was compiled using the 
following parameters; program - MacVector 6.5, matrix - 
BLOSUM 30, Clustal alignment - pairwise alignment, Open 
20 Gap penalty - 1.0, Extend Gap penalty - 0.1. 

Table 2 shows a summary of the K4 methylation state at 
the promoter of constitutive genes, inositol regulated 
genes and methionine regulated genes. means no 

25 methylation detected. w Di" means di -methylation detected 
at the promoter and *tri" means tri -methylation at the 
promoter. The- growth conditions for activation and 
repression of. the genes are described below. 

30 EXPERIMENTAL 

Materials and Methods 
Purification of yeast SET Protein 

Yeast Setlp and Set6p were tagged at the N-terminus with 
a Protein A epitope from S. aureus. These proteins were 
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expressed in the wild type strain RS453 (mat a, ade2 , 
ura3, leu2, his3, trpl) and in the C 13 ABYS-86 strain 
(Matct ura3 leu2-3 his3-112 pral-1 prbl-1 prcl-1 cpsl-3) . 
The tagged proteins with the expected size (PtA-Setlp, 
5 160Kda; PtA-Set6p, 55Kda) were detected in total extract 
by western blot with Anti Protein A antibody (PAP, 
rabbit, DAKO- Z0113) 1:3000 dilution. PtA-Setlp and PtA- 
Set6p were purified by affinity to an IgG Sepharose 
column as described (Siniossoglou, S. et al (199'6) Cell 
10 84, 265-275). For each set of assays, 5g of spheroplasts 
were homogenised in 50ml of lysis buffer. The soluble 
supernatant (S) was passed through a column containing 
2 00 ]il of IgG beads. 

15 Avoiding the elut ion step, the beads were washed with 25 
ml of equilibration buffer. Then, 60 ul of beads were 
transferred to 3 eppendorf tubes, placed on ice, for the 
enzymatic reaction. The remaining 20 ul beads were re- 
suspended in 50pl Laemli buffer, vortex and boiled for 3 

20 min. Samples of "soluble supernatant, S" (3 0]il , 

equivalent to 0.06% of the total soluble supernatant) and 
"eluted protein, E" (25)11, equivalent to 5% of the total 
purified protein) were analysed by western blot with Anti 
Protein A antibody (PAP, rabbit, DAKO- Z0113) 1:3000 
25 dilution. 

Equilibration Buffer: 20mM Tris-HCl pH 8.5; 0.05 mM DTT. 

30 Methyltransf erase Assay 

The enzymatic assays were performed on- a mixture of 
soluble histones from calf thymus (H2A, H2B, H3 and H4 ) , 
Sigma; recombinant j»cenopus H3 or H3 deleted for the first 
26 amino acids {Atail H3) . 
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For each reaction, 60]ig of calf histories or 20ug of 
recombinant H3 were used. The following mixtures were 
prepared: 

5 1) 60 ul of PtA-Set p beads (either Setlp, Set2p or 
Set6p) + 30 \il of methylation buffer + 2 pi of 14 C-SAM 
(NEC-363 adenosyl-L-methionine, S- (methyl- 14 C) - 

2) 60 ul of PtA-Set p beads (either Setlp, Set2p or 
10 Set6p) + 30 ul of methylation buffer* 3 ul of soluble 

histones (60 ug) + 2 ul of 14C-SAM (NEC-363 adenosyl-L- 
methionine, S- (methyl- 14 C) . 

3) 60 ul of PtA-Set p beads (either Setlp, Set2p or 

15 Set6p) + 30 ul of methylation buffer + 4ul of recombinant 
xH3 (20ug)+ 2ul of 14C-SAM (NEC-363 adenosyl -Lmethionine , 
S- (methyl- 14C) . 

4) 60 ul of IgG sepharose beads (previously equilibrated 
20 in methylation buffer) + 30 ul of methylation buffer + 3 

Ul of soluble histones (60 ug) + 2 ul of 14C-SAM (NEC-363 
adenosyl-L-methionine, S- (methyl-14C) . 

5) 60 ul of IgG sepharose beads (previously equilibrated 
25 in methylation buffer) + 30 ul of methylation buffer + 

4ul of recombinant xH3 (20ug) + 2 ul of 14C-SAM (NEC-363 
adenosyl-L-methionine, S- (methyl -14C) . 



Methylation Buffer: 50mM Tris-HCl pH 8.5; 20mM K Cl, lOmM 
30 MgC12, lOmM -mercaptoethanol , 0.05mM DTT , 250mM sucrose, 
0.2% dodecyl-p-D-maltoside. 

When assayed, the recombinant xenqpus H3 deleted for the 
26 first amino acids (xgH3) was used at 10ug/reaction. 
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The microfuge tubes were incubated at 3 0°C during 2 hours 
with shaking. After that, 15 pi of 4x (protein sample 
buffer) were added to each reaction (the total liquid 
vol. of each reaction should be around 45ul) . The samples 
were boiled for 3 minutes. The histones were separated in 
a 1.5 trim, 20%' acrylamide gel, run at 200 V during 1 hour 
and 15 minutes and transferred to nitro-cellulose 
membranes by standard procedures. The nitro-cellulose 
membrane was exposed to Kodak Biomax MS film with Biomax 
Transcreen-LE intensifying screen at -80C during 36-48h. 

Mapping of Site of Methylation within Histone H3 
Histone H3 was methylated by SET1 as described herein 
with a 14 "C-methyl group. The methylated residue was then 
identified using N terminal sequence analysis as 
described in Rein D. and Speicher D. Current Protocols in 
Protein Science (1997) 11.10.01-11.10.38. 

Briefly, A stained protein band was excised from a PVDF 
membrane, sonicated in a both sonicator and sequenced 
using a Hewlett-Packard Model G 10 0SA sequencing system. 
Sequencing was carried out according to the. 
manufacturer's instructions and PTH (phenylthiohydantoin) 
amino acids from each sequencer cycle were separated by 
in-line HPLC to provide sequence data. 

Expression Profiling 

The setlA mutant and its parent strain UCC1001 were grown 
in parallel in YPD media at 3 0°C to an OD 600 of 1.0. Poly 
(A) RNA was isolated and reverse transcribed 
incorporating amino- ally 1 dUTP. The resulting DNA probe 
was labeled with reactive Cy5 (mutant) or Cy3 (parent 
strain) dye and hybridized to a spotted cDNA microarray 
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containing the yeast open reading frames as described in 
Carroll et al (2001) PNAS 98 12578-12583 and DeRisi et al 
(1997) Science 278 680-686. Microarrays were analyzed 
using a GenePix4000A scanner and GENEPIX 3.0 software. 
Candidate SETI regulated genes were identified from ■ 
duplicate experiments using a gene specific error model 
described previously (Hughes et al Cell (2000) 102 109- 
126). 

Northern blots and ChlPs 

Northern blots arid ChiPs were done as described in Kent 
et al Genes Dev (2001) 15 619-626, except that for ChIP 
5|il of Tri-meK4 H3 antibody and 2pl of Di-me K4 H3 
antibody were used and protein A-sepharose beads were 
pre-incubated with 1 . 5pg of sonicated salmon sperm DNA 
for 3 0 minutes. For the ChlPs experiments, the cell were • 
grown o/n in minimal medium lacking inositol (medium - 
ino) , then diluted to OD600: 0.15 in minimal medium - 

i 

inositol or the same one supplemented with lOOmg/L 
inositol (medium +ino) and grown to OD600: 1.2 and then 
processed for chromatin preparation. 

Samples from medium -methionine and +methionine were 
prepared in the same way except that the medium was 
supplemented . 

Affinity purification from Hela nuclear extract 
Histone peptides which were trimethylated at different 
lysine residues, were made using standard techniques. 
The C- terminus of the peptides contained a 2 glycine 
spacer followed by a cysteine. The peptides were 
immobilised onto Sulfolink Coupling gel (Pierce) via the 
C-terminal cysteine, at a concentration of lmg/ml." 
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Hela nuclear extract (Computer Cell Culture Centre, 
Belgium) was diluted in IPH buffer (50mM Tris pH8, 150mlVI 
NaCI, 5mM EDT A. 0.5% NP-40 v/v) to a final protein 
5 concentration of 4mg/ml and pre-cleared.- 

Affinity purifications were done with 5]il of sepharose- 
linked peptide and 150pl of diluted Hela nuclear extract. 
Competitor peptide was added to a final concentration of . 
150pg/ml, where used. The purifications were incubated 
for 90 minutes on a wheel at 4°C and washed 3 times in 
IPH before resolution on an 8% SDS-PAGE gel. For column 
purification of complex, 2 ml of diluted Hela nuclear 
extract was poured twice over a micro-column of 100pl of 
sepharose-linked - unmethylated H3 peptide. The column was 
washed with 4 column volumes of IPH and bound proteins 
were eluted in 250mM NaCI IPH. Eluted fractions 
containing the complex were made up to 150p.l in IPH and 
rebound to 5pl of H3 peptide sepharose, as. above- Bound 
proteins were visualised by silver staining as described 
previously (Morrissey (1981) 117(2) 307-310). 

Mass spectrometry identification of proteins 
Proteins that rebound un-methylated peptide after column 
25 purification were excised from an 8% coomassie stained 
gel with a protein- free razor blade. Proteins were 
digested in-gel with trypsin. After overnight incubation 
at 3 0°C, a 0 . 5pl aliquot from the digest was spotted on 
the MALDI target for fingerprinting. The remaining digest 
30 was purified using C18 Zip-Tips prior ESI -MS. 

Peptide Mass Fingerprints were taken with a MALDI 
instrument from Micromas fitted with Delayed Extraction, 
using Alpha -cyano-cinnamic acid dissolved in 50% 
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Acetonitrile : 0.1% TFA as matrix. CID fragmentation 
spectra from peptides were taken with an Ion-trap 
instrument (LCQ-Decca) from ThermoQuest. Protein databases 
were searched with the mass spectrometry data using the 
programs Mascot, http : //matrixscience . com (Mass 
fingerprinting and fragmentation spectra) and 
Profound, http://129.85. 19. 192 (mass fingerprinting data). 

Deacetylase assays 

Histone peptide affinity purifications from Hela nuclear 
extract were re- suspended in lOOyl of fresh IPH buffer 
containing 150,000 cpm of [ 3 H] -labelled acetylated H4 
peptide. Reactions were incubated at 37 °C for 120 
minutes, with regular mixing. Reactions were stopped by 
the addition of 65yil of acid mix ( ^1 M HCl, 0.16 M acetic 
acid) . 700pl of ethylacetate was added and tubes were 
vortexed vigorously for 20 seconds. Phases were separated 
by centrif ugation at 13500 rpm for 1 min. 500yl of the 
upper phase (ethylacetate) was removed " and mixed with 1ml 
of scintillant (Optiphase™, Wallac) . Release of ( 3 H] - 
acetate from the H4 peptide was measured in counts per 
minute (cpm) using a scintillation counter (Beckraann 
LS6000SC) . 

Western blots, antibodies and immunoprecipi tat ions 
Bound proteins were resolved on 8% SDS-PAGE gel, blotted 
to nitrocellulose and blocked overnight (4% non-fat 
milk/0.5% Tween 20 v/v) . Blots were probed with the 
following antibodies for 1 hour at room temperature: 
anti-Rbap48 (Genetex, llglO, lyig/ndj, anti-DNMTl (NEB- 
231, 1 in 1000), anti-Sin3 (Santa Cruz 994, lpg/ml) and 
anti-p60 (1 in 1000) . Blots were washed in blocking 
buffer, incubated with HRP linked secondary antibodies 
(Abeam) and visualised with ECL (Amersham) . NuRD 
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immunoprecipi tat ions were done by incubating 500pl of 
diluted Hela nuclear extract with anti-MTA2 antibody 
(Santa Cruz 9447: 1 \ig/ml) and- 15 \xl of protein A/G bead 
mix (Amersham) for 5-12 hours at 4°C. 

5 

Precipitations were washed 3 times with IPH before 
resolution on a 20% SDS-PAGE gel. After blotting to 
nitrocellulose and blocking (5%BSA/0.5% Tween 20) for 1 
hour at room temperature , the blots were probed overnight 
10 with anti-MBD3 (Santa Cruz 9402, lpg/ml) , anti-H3 
(500ng/ml) or anti-H4 (Abeam ab7311, 500ng/ml) . 

Blots were washed briefly with BSA blocking buffer, 
incubated with HRP linked secondary antibody and 
15 visualised as above. 

Results 

S. cerevisiae SET1 and SET2 are Histone 
Methyltransf erases (HMTs ) 
20 PtA tagged SETlp, SET2p and SET6p were purified from S. 
. cerevisiae by IgG sepharose chromatography. Western 
blotting revealed a strong band in each of the eluted 
protein fractions corresponding to full-length PtA-SETlp, 
PtA-SET2p and PtA-SET3p, respectively. 

25 

Approximately 10ug of purified PtA-SETlp, PtA-SET2p or 
PtA-SET6p were assayed for methyltransf erase activity on 
60pg of a mixture of calf thymus soluble histones (H2A, 
H2B, H3 and H4) or 20ug of recombinant xenopus histone H3 
30 in the presence of ( 14 C-Me) S-adenosyl methionine. 

Autoradiograms were developed after 48 hours exposure. 



No methylation was observed for PtA-SETlp, PtA-SET2p or 
PtA-SET6p in the absence of calf or recombinant histones. 
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PtA-SETlp and PtA-SET2p were observed to methylate 
recombinant Xenopus histone H3 . When the preparation of 
histones H2A, H2B, H3 and H4 was used, PtA-SETlp and 
5 PtA-SET2p were, observed to specifically methylate histone 
H3 . 

. No methylation activity was observed for PtA-SET6p in the 
presence of histones H2A ; H2B, H3 and H4 or the presence 
10 of recombinant Xenopus histone H3 . 

The Methyltransferase Activity of SET1 is Specific for 
Lysine 4. 

. A truncated recombinant Xenopus H3 histone containing 
15 lacking the first 26 N terminal residues was compared 
with the full length Xenopus H3 histone as a substrate 
for PtA-SETlp and PtA-SET2p. 

For each assay, 20ug of recombinant - Xenopus H3 histone 
20 andlOug of PtA-SETlp were incubated in the presence of 

( 14 C-Me) S-adenosyl methionine. Autoradiograms showing the 
presence of 14 C label were developed after 48 hours. The 
presence of substrates on the gel was confirmed by 
Ponceau staining. 

25 

No methylation was observed in the absence of a histone 
H3 substrate. 

Both PtA-SETlp and PtA-SET2p were observed to methylate 
30 full length Xenopus histone H3 . 
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No methylation by PtA-SETlp of the Xenopus histone H3 
with the N terminal deletion was observed but PtA-SET2p 
was observed to methylate this truncated protein. 
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These results demonstrate that PtA-SETlp methylates 
histone H3 within the N terminal 26 amino acid residues 
of histone H3 whereas PtA-SET2p methylates outside this 
amino tail region, within the core of the histone H3 
polypeptide . 

Mapping of the Methylation Site within the H3 Amino Tail 
Xenopus H3 histone labelled with a 14 C-methyl group by 
PtA-SETlp as described above, was sequenced by sequential 
Edman degradation and fractions corresponding to each 
amino acid cycle were collected and counted by 
scintillation counting. The results are shown in Figure 
3. The N terminal sequence of Xenopus histone H3 is shown 
underneath corresponding fraction. 

These results show that the lysine 4 residue of histone 
H3 is specifically methylated by PtA-SETlp. No 
methylation of lysine 9 is observed". 

SET polypeptides as described herein are shown to 
specifically methylate histone H3 at residues other than 
lysine 9. 

SET1 polypeptides methylate the lysine 4 residue while 
SET2 polypeptides methylate the core of histone H3, 
outside the amino tail. 

The histone H3 N- terminus binds a set of proteins when 
unmethylated at lysine 4 

Differently methylated histone peptides were used in pull 
downs assays from Hela nuclear extract and the bound 
proteins analysed by silver stain. 
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The histone peptides used were trimethylated at lysine 4 
(K4) , lysine 9 (K9) or both (K4+K9) and two control 
peptides of unmethylated H3 and H4 tails (Figure. 5) . 
These peptides were immobilised onto sepharose beads via 
a C-terminal cysteine residue and used to affinity purify 
proteins from Hela nuclear extract. 

Although some proteins were found to specifically bind to 
the methylated peptides, a number of proteins were also 
observed to be specifically purified by the unmethylated 
H3 peptide. 

This set of, proteins was also observed to be purified by 
the K9 methylated H3 peptide, but not by a peptide 
methylated at K4, or K4 and K9, or by an unmethylated H4 
peptide. This indicates that these proteins only bind to 
histone H3 N-termini when lysine 4 is unmethylated and 
that methylation of lysine 9 does not disrupt the 
binding of the complex. 

To confirm the specificity of the proteins binding to 
unmethylated H3 peptide, pull downs were done in the 
presence or ahsence of competitor peptide. The set of 
proteins bound to the unmethylated peptide was observed 
to be competed away by peptides unmethylated at K4, but 
not by peptides with a- methylated K4 . 

The set of proteins bound to unmethylated H3 tails is the 
NuRD complex 

The set of proteins, putatively components of a binding 
complex, identified above were purified from Hela nuclear 
extract' over a column of unmethylated H3 peptide. The 
column-purified proteins, once eluted in high salt, were 
rebound to unmethylated peptide in a pull down assay. The 
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rebound proteins were resolved by SDS-PAGE and silver 
stained. This purified source of the putative complex was 
used for mass spectrometry identification of the 
components . 

5 

The sub-units of the complex identified in this way 
include the histone deacetylases HDAC1/2, the ATPase 
chromatin -remodel ling enzyme Mi-20, the RB associated 
proteins Rbap48/46, the metastasis-associated antigens 
10 MTA1/2 and the methyl CpG binding domain MBD3 . These are 
all known components of the NuRD complex (Zhang et al 
(1998) Cell 95(2) 279-289, Zhang et al (1999) Genes Dev. 
13(15) 1924-1935, Xue et al (1998) Mol Cell 2(6) 851- 
861) . 

15 

Several components of the NuRD complex have been 
identified in other deacetylase complexes (Knoepfler, P. 
S. et al (1999) Cell 99, 447-450). The purified complex 
with antibodies against other deacetylase complex 

20 components. Rbap48 protein was detected, whereas DNMT1 
and Sin3 (present in distinct deacetylase complexes: 
Robertson, K. D. et al (2000) Nat Genet 25 (3) , 338-42 . 
22, Zhang, Y et al(1997) Cell 89 (3) , 357-64) , were absent 
from the H3 purified complex . Rbap4 8 is also 

25 present in the CAF-1 (chromatin assembly factor 1) 
complex (Ridgway, P. et al (2000) J Cell Sci 113 {pt 
15) , 2647-58 .) . However a western blot with the p60 
subunit of CAF-1 did not detect CAF-1 in the H3 purified 
complex. 

30 

To establish if NuRD has deacetylase activity when 
affinity purified on unmethylated H3 peptide, peptide 
pull downs were assayed for deacetylase activity . 
Unmethylated H3 peptide was found to associate with a 
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significant level of histone deacetylase (HDAC) activity 
(Figure 6) . In contrast, a peptide methylated at K4 
pulled down background levels of HDAC activity. This 
indicates that the deacetylase activity of the NuRD 
5 complex is not compromised when it is associated with the 
unmethylated histone H3 tail. 

NuRD complex associates with native histone H3 only when 
unmethylated at lysine 4 

10 Previous studies have shown that the NuRD complex can be 
immunoprecipitated using anti-MTA2 antibodies (Zhang et . 
al (1999) supra) . MT A2 antibodies were observed to 
precipitate MBD3 (a component of NuRD) as well as 
histone H3 and H4, as detected by western blot with the 

15 respective antibodies". 

The immunoprecipitated histones were found to be 
efficiently competed away by the unmethylated 
peptide, but not by the K4 methylated peptide. These data 
20 indicate that the NuRD complex associates with 
unmethylated native histone H3 (and interacts 
indirectly with H4 via H3) . 

Methylation Status of K4 of Histone H3 

25 The e amino group of lysine residues may be mono-, di- or 
tri -methylated (me) . Antibodies were raised to 
specifically recognize either' the di-me or tri -me state 
of K4 H3 . An antibody raised against di-me K4 was found 
to recognize H3 from a purified source of histones and "is 

30 effectively competed by a di-me but not a tri -me K4 H3 
peptide. An antibody raised against tri-me K4 was found 
to be specifically competed by the tri-me but not di-me 
K4 H3 peptide. Both antibodies were found to selectively 
recognise histone H3 in a total yeast extract. Neither 
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antibody recognised H3 from a yeast strain containing a 
K4 H3 mutation to alanine (K4A) and neither antibody 
recognised bacterially produced recombinant, unmethylated 
H3 . 

Both Di- and Tri-me K4 antibodies were found to recognise 
histone H3 after it has been methylated by SETI. 
Deletion of SET1 abolishes recognition of histone H3 by 
the Di- and Tri-me K4 antibodies, whereas deletion of 
other S. cerevisiae SET genes (SET2, 3,4, 5 and 6) has 
no effect. 

To establish if the bi- or Tri-me state is linked to gene 
activity, we investigated whether SETI can act as an 
activator of transcription and identified genes 
positively regulated by SETI. 

Transcriptional profiling of yeast deficient in SETI was 
therefore carried out and 200 genes were identified whose 
activity is reduced in the absence of SETI. Figure 7 
shows a list of the top twenty genes whose transcription 
is reduced between 1.6 and 2.1 fold in setlA strain 
relative to WT yeast. Northern blot analysis of one of 
these genes PPH3 , confirms that the mRNA expression of 
this gene is indeed reduced in the setlA strain. This 
indicates that SETI positively regulates transcription. 

We next established whether the SETI • activated genes were 
methylated at K4H3 and identified the methylation state 
of K4. The PPH3 gene, as well as two of the other SET1- 
dependent genes, HAM1 and NUP 170, were investigated. The 
use of Di-me and Tri-me specific antibodies in chromatin 
immunoprecipitation (ChiPs) showed that all three genes 
have Di- and Tri-me K4 H3 associated with their promoter. 
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Since the expression profiling was done from yeast grown 
in rich medium, all three of these genes, and indeed all 
the SET1 -regulated genes in Figure 7, are constitutively 
5 active. 

The K4 H3 methylation status of a set of genes whose 
activity can be regulated was then investigated using 
genes that can be activated by either inositol or 

10 methionine deprivation. When the INOl gene is repressed, 
K4 H3 was found to be Di-me but not Tri-me. However,' when 
the gene is active, K4 H3 Di-me was still observed but 
Tri-me also became apparent. Two other inositol regulated 
genes IN02 and IN04 also showed the presence of Tri-me K4 

15 H3 only when active. 

Analysis of an independent metabolic pathway, where genes 
are actively transcribed in the absence of methionine, 
showed that the MET16 gene has Tri-me K4 H3 only when the 

20 gene is active and Di-me only when repressed. Another 
methionine regulated gene, METI7 , showed no Di-me K4H3, 
either when active or repressed but tri-me K4 H3 appeared 
when the gene became active. The K4 methylation patterns 
we observed at all the genes analysed here were not ■ 

25 restricted to promoter regions but were also observed 
when probes in the coding region were used. 

These results (summarized in Table 2) indicate that the 
trimethyl state of K4 H3 is unambiguously linked to 
30 active transcription. When genes are repressed, tri-me K4 
H3 is not observed. In contrast, Di-me K4 H3 is often 
present in repressed genes but is also present on some 
active genes. This is consistent with previously reported 
findings at the chicken beta-globin locus, where Di-me K4 
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H3 is detected on large euchromatic regions encompassing 
both active and inactive genes (Litt et al Science 293 
2453-5 (2001)). Di-me K4 H3 may determine a 
transcriptionally active state "poised" for stimulation. 

5 

These findings demonstrate that SET1 is an activator of 
transcription and uncover a new level of regulation of K4 
H3 function, by virtue of methylation status. 

10 Methylation of histone H3 by SET polypeptides plays an 
important role in the regulation of transcriptional 
activity and the control of cellular proliferation. 
Manipulation of the SET polypeptide histone 
methyltransf erase activity may therefore be used in the 

15 treatment of conditions associated with cellular 
proliferation, such as cancer. 
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Claims : 

1. An assay method for an agent which modulates the 
interaction of histone H3 with a SET polypeptide, the 
method including: 

(a) bringing into contact a SET polypeptide and a 
histone H3 polypeptide in the* presence of a test 
compound; and, 

(b) determining interaction between the histone H3 
polypeptide and the SET. polypeptide. 

2. An assay method for an agent which modulates the 
histone H3 methylase activity of a SET polypeptide, the 
method including: 

(a) bringing into contact a SET polypeptide and a test" 
compound ; and , 

(b) determining the histone H3 methylase activity of the 
SET polypeptide. 

3 . An assay method according to claim 2 wherein the SET 
polypeptide and the test compound are brought into 
contact in the presence of a histone H3 polypeptide, , and; 
the histone H3 methylase activity is determined by 
determining the presence of one or more methyl groups at 
a lysine* residue of the histone H3 polypeptide. 

4. An assay method according to claim 3 wherein said 
lysine residue is selected from the group consisting of 
lysine 4, lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 
lysine 122 and lysine 125. 

5. An assay method according to claim 3 wherein said 
lysine residue is lysine 4 . 



m 
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6 . An assay method according to claim 5 wherein the SET 
polypeptide is a SET1 polypeptide. 

7. An assay method according to claim 6 wherein the di 
and/or tri methylation of the Lys 4 residue by the SET 1 
polypeptide is determined. 

8 . An assay method according to claim 7 comprising 
determining the transcriptional activity of chromatin 
comprising said histone. 

9. An assay method according to any one of claims 2 to 
4 wherein the SET polypeptide is a SET2 polypeptide and 
methylation of a lysine residue of said histone H3 
polypeptide selected from the group consisting of lysine 
27, lysine 36, lysine 37 , lysine 42, lysine 56, lysine 
64, lysine 79, lysine 115, lysine 121, lysine 122 and 
lysine 125 is determined. 

10. A method according to any one of claims 1 to 9 
comprising identifying and/or obtaining a test compound 
which modulates the histone H3 methylase activity of a 
SET polypeptide. 

11. A method according to any one of claims 1 to 10 
further comprising; testing for a biological function of 
methylated histone H3 . 

12 . A method according to claim 11 wherein said 
biological functiofi is transcriptional activation. 



13. A method according to any one of claims 1 to 10 
further comprising; testing for the absence of a 
biological function of an unmethylated histone H3 . 
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14 . A method according to claim 13 wherein said 
biological function is binding to NuRD complex. 

5- 15. A method according to any one of claims 2 to 14 

comprising isolating and/or purifying the test compound. 

16. A method according to claim 15 comprising 
formulating said compound with a pharmaceutically 

10 acceptable excipient. 

17. A SET polypeptide fragment having histone H3 
methylase activity. 

15 18. A fragment according to claim 17 which is a SET1 

polypeptide fragment which methylates lysine 4 of histone 
H3. 

19. A fragment according to claim 18 wherein said SET1 
20 polypeptide is hSET-lP (human KIAA0339) or hSET-2P (human 

KIAA1076) . 

20. A fragment according to claim 19 wherein said SET 
polypeptide is a SET2 polypeptide which methylates a 

25 lysine residue of the histone H3 core. 

21. A fragment according to claim 20 wherein said SET2 
polypeptide is hSET2-l, hSET2-2 / hSET2-3 or WHSC1. 

30 22. A method of producing a SET polypeptide comprising; 
expressing a SET polypeptide from encoding nucleic acid, 
and; 

determining the methyltransf erase activity of the 
expressed SET polypeptide. 



+ 
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23. A agent which modulates SET polypeptide activity- 
obtained by a method according to any one of claims 1 to 
16. 

24. A pharmaceutical composition comprising an agent 
according to claim 23. 

25. A method of making a pharmaceutical composition 
comprising admixing a substance according to claim 23 
with a pharmaceutical^ acceptable excipient, vehicle or 
carrier, and optionally other ingredients. 

26. - A method of treatment of a disorder of cellular 
proliferation which includes administering to a patient 
an agent according to claim 23. 

27. Use of an agent according to claim 23 in the 
manufacture of a medicament for treatment of a 
proliferative disorder. 

28. An assay method for an agent which modulates the 
interaction of histone H3 with a NuRD repressor, 

the method including: 

(a) bringing into contact one or more components of the 
NuRD repressor complex and a histone H3 polypeptide in 
the presence of a test compound; and, 

(b) determining interaction between the histone H3 
polypeptide and the said one or more components. 

29. A method according to claim 28 wherein the histone 
H3 polypeptide is unmethylated at histone H3 . 

30. A method of identifying transcriptionally active 
chromatin comprising; 
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determining the presence or absence of a 
trimethylated Lys 4 residue in a histone H3 polypeptide 
of said chromatin. 

31. A method according to claim 31 wherein said method 
comprises; contacting said chromatin region with an 
antibody which binds specifically to a histone H3 
polypeptide which is trimethylated at Lys 4 . 
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1: H3(M6) UnmethyJated 

2: H3(l-16) K4+K9 Methylated 

3: H3(l-16) K9 Methylated 

4: H3(M6) K4 Methylated 

5: H4(l-14) Unmethylated 
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